Calibrated Structured Prediction
نویسندگان
چکیده
In user-facing applications, displaying calibrated confidence measures— probabilities that correspond to true frequency—can be as important as obtaining high accuracy. We are interested in calibration for structured prediction problems such as speech recognition, optical character recognition, and medical diagnosis. Structured prediction presents new challenges for calibration: the output space is large, and users may issue many types of probability queries (e.g., marginals) on the structured output. We extend the notion of calibration so as to handle various subtleties pertaining to the structured setting, and then provide a simple recalibration method that trains a binary classifier to predict probabilities of interest. We explore a range of features appropriate for structured recalibration, and demonstrate their efficacy on three real-world datasets.
منابع مشابه
Consistency of structured output learning with missing labels
In this paper we study statistical consistency of partial losses suitable for learning structured output predictors from examples containing missing labels. We provide sufficient conditions on data generating distribution which admit to prove that the expected risk of the structured predictor learned by minimizing the partial loss converges to the optimal Bayes risk defined by an associated com...
متن کاملOn Structured Prediction Theory with Calibrated Convex Surrogate Losses
We provide novel theoretical insights on structured prediction in the context of efficient convex surrogate loss minimization with consistency guarantees. For any task loss, we construct a convex surrogate that can be optimized via stochastic gradient descent and we prove tight bounds on the so-called “calibration function” relating the excess surrogate risk to the actual risk. In contrast to p...
متن کاملValidation and calibration of the Kabi Pharmacia International Growth Study prediction model for children with idiopathic growth hormone deficiency.
In 1999 a model was published for prediction of growth in children with idiopathic GH deficiency (IGHD) during GH therapy, derived using data from the Kabi Pharmacia International Growth Study (KIGS) database (Pharmacia \|[amp ]\| Upjohn, Inc., International Growth Database). We validated and calibrated this KIGS model for growth in the first year of GH therapy using data from 136 Dutch childre...
متن کاملApplication of the GTN Model in Ductile Fracture Prediction of 7075-T651 Aluminum Alloy
In this paper the capability of Gurson-Tvergaard-Needleman (GTN) model in the prediction of ductile damage in 7075-T651 aluminum alloy is investigated. For this purpose, three types of specimens were tested: Standard tensile bars, Round notched bar (RNB) specimens and compact tension (C(T)) specimens. Standard tensile bar tests were used to obtain the mechanical properties of the material and t...
متن کاملCalibrated Prediction Intervals for Neural Network Regressors
Ongoing developments in neural network models are continually advancing the state-of-the-art in terms of system accuracy. However, the predicted labels should not be regarded as the only core output; also important is a well calibrated estimate of the prediction uncertainty. Such estimates and their calibration is critical in relation to robust handling of out of distribution events not observe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015